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Abstract 

The  prevalent  method  of  synthesizing  nonlinear 
missile  autopilots  is  by  gain-scheduling  linear  designs. 
Although  this  approach  has  proven  successful  in 
numerous  applications,  the  desire  to  continually 
improve  performance  without  incurring  additional  cost 
suggests  the  need  for  a  new  design  paradigm.  An 
opportunity  to  address  this  need  has  been  identified 
from  previous  research  which  employed  neural  network 
technology  to  augment  approximate  dynamic  inversion 
controllers.  In  the  one  architecture  a  neural  network 
adaptively  cancels  linearization  errors  through  on-line 
learning,  which  may  be  accomplished  by  a  weight 
update  rule  derived  from  Lyapunov  theory.  This 
effectively  guarantees  stability  of  the  closed-loop 
system.  This  paper  concerns  a  similar  implementation  in 
which  neural  networks  function  instead  to  improve 
command  tracking  of  gain-scheduled  control  laws.  This 
theoretical  development  is  then  specialized  to  the 
problem  of  synthesizing  a  bank-to-turn  autopilot  for  an 
agile  anti-air  missile.  Finally,  the  resulting  hybrid 
control  law  is  demonstrated  in  a  nonlinear  simulation 
and  its  performance  is  evaluated  relative  to  that  of  the 
unaugmented  gain-scheduled  autopilot. 

Introduction 

Historically,  the  field  of  missile  autopilot  design 
has  been  dominated  by  the  technique  of  gain¬ 
scheduling.  This  approach,  documented  in  Ref.  1,  is 
based  on  linearizing  the  vehicle’s  nonlinear  equations  of 
motion  about  several  operating  conditions.  Linear 
controllers  are  then  designed  for  each  of  these  points  in 
the  flight  envelope,  and  their  gains  are  scheduled  as  a 
function  of  flight  condition  to  yield  a  full-envelope 
nonlinear  autopilot.  Although  this  approach  has  proven 
successful  in  numerous  practical  applications,  it  does 
have  certain  disadvantages.  Theoretical  investigations 

‘Aerospace  Engineer.  Member,  AIAA. 

APPROVED  FOR  PUBLIC  RELEASE:  DISTRIBUTION  IS 
UNLIMITED. 


Donald  T.  Stansbery* 

QuesTech 

214  Government  Street 
Niceville,  Florida,  USA  32578 

of  gain-scheduling  such  as  the  one  in  Ref.  2  have  shown 
that  controller  gains  should  be  scheduled  as  functions  of 
slowly  varying  quantities  if  favorable  performance  and 
robustness  properties  of  the  individual  point  designs  are 
to  be  preserved  in  the  overall  scheduled 
implementation.  Control  designers  are  increasingly 
faced  with  unprecedented  performance  requirements 
which  necessitate  a  large  number  of  point  designs  to 
form  an  adequate  gain  schedule.  A  large  gain  schedule 
not  only  requires  more  time  and  labor  to  design,  it  also 
creates  the  need  for  more  wind-tunnel  testing  and 
additional  computation  and  storage  capacity  in  the  flight 
control  processor. 

A  potential  alternative  to  the  present  trend  toward 
more  complicated  gain-scheduled  designs  may  be  found 
in  previous  research  using  neural  network  technology  to 
augment  approximate  dynamic  inversion  controllers. 
Refs  3  and  4  are  among  the  examples  of  this  approach 
to  nonlinear  control  design.  The  development  to  be 
presented  in  this  paper,  however,  is  based  on  the 
architecture  of  Ref.  5,  in  which  a  neural  network 
adaptively  cancels  linearization  errors  through  on-line 
learning.  Said  learning  is  accomplished  by  a  weight 
update  rule  derived  from  Lyapunov  theory, 
guaranteeing  stability  of  the  closed-loop  system. 

This  control  design  methodology  was  shown  to 
perform  exceptionally  in  nonlinear  simulation  studies  of 
fixed-wing  and  rotary- wing  aircraft  in  Refs.  5  and  6, 
respectively.  In  Ref.  7,  the  same  technique  was  used 
with  minor  modifications  to  design  a  nonlinear  bank-to- 
turn  autopilot  for  the  agile  missile  described  in  Ref.  8. 
Simulation  studies  of  high  angle-of-attack  maneuvers 
indicated  that  the  neural  network-based  adaptive 
nonlinear  controller  achieved  performance  comparable 
to  that  of  a  gain-scheduled  autopilot  with  reduced 
development  effort.  This  led  to  further  research 
centered  on  more  complicated  multi-layer  neural 
network  architectures  in  Ref.  9  and  robustness  to  linear 
unmodeled  dynamics  in  Ref.  10.  This  paper  addresses 
the  question  of  whether  the  same  neural  network 
technology  could  be  used  to  augment  an  existing  gain- 
scheduled  autopilot  rather  than  an  approximate 
nonlinear  design.  The  preliminary  Results  presented  in 
Ref.  1 1  indicated  that  by  starting  with  a  gain-scheduled 
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design  and  introducing  an  adaptive  neural  network  one 
may  be  able  not  only  to  reduce  errors  associated  with 
operating  at  conditions  between  design  points,  but  also 
enlarge  the  flight  envelope  beyond  what  is  currently 
covered  by  the  gain  schedule.  While  the  development  in 
Ref.  11  considered  only  the  simple  case  of  neural 
networks  with  no  hidden  layers,  the  current 
investigation  employs  the  more  powerful  single-hidden- 
layer  architecture  of  Ref.  9. 

This  paper  begins  by  presenting  a  general 
architecture  for  integrating  a  gain-scheduled  control  law 
with  an  adaptive  neural  network  with  the  objective  of 
improving  command  following.  Next,  this  methodology 
is  applied  in  the  specific  context  of  a  missile  autopilot 
design  problem.  An  existing  gain-scheduled  autopilot  is 
augmented  using  neural  networks,  and  nonlinear 
simulation  results  are  illustrate  the  effectiveness  of  the 
resulting  design.  Finally,  conclusions  and  directions  for 
future  research  are  presented. 


Control  Design  Methodology 

The  following  development  concerns  a 
modification  of  the  technique  presented  in  Ref.  1 1  for 
augmenting  gain-scheduled  controllers  using  neural 
networks.  The  feature  which  distinguishes  this  result 
from  the  previous  efforts  is  that  it  is  admits  the 
important  class  of  neural  networks  with  one  hidden 
layer  composed  of  sigmoidal  neurons. 

Problem  Formulation 


Although  the  development  may  readily  be 
generalized  to  the  multi-input,  multi-output  (MIMO) 
case,  we  will  restrict  our  attention  to  single-input, 
single-output  (SISO)  case  for  simplicity  of  presentation. 
Consider  a  second-order  SISO  nonlinear  system: 

x  =  f(x,x)+g(x,x)i  (1) 

where  x(t),x(t)e9I  are  the  state  variables,  u(t)e  91  is 
the  control  variable,  and  f  and  g  are  sufficiently  smooth 
nonlinear  mappings  from  91x91  into  91 ,  Further, 
suppose  that  the  desired  response  of  the  closed-loop 
system  is  modeled  by  a  linear  second-order  transfer 
function  Gm(s): 


Gm(s)  = 


co 


2 

n 


s2  +  2^on+co~ 


(2) 


where  0)n  and  denote  the  natural  frequency  and 

damping  ratio,  respectively,  of  the  desired  response. 
Here,  xm  represents  the  state  of  the  desired  response 
model  and  r  denotes  an  externally  applied  reference 
command.  Note  also  that  Eq.  (2)  may  be  expressed  in 
differential  equation  form  as  follows: 


=-®n*m-2i;0)nkm+G^r 

Gain-Scheduled  Linear  Control  Design 

In  an  effort  to  achieve  this  desired  response,  a  gain- 
scheduled  controller  may  be  synthesized  by  linearizing 
Eq.  (1)  about  several  operating  points  and  employing 
any  suitable  linear  control  scheme.  The  details  of  this 
design  will  be  omitted  here,  as  we  elect  instead  to  focus 
on  the  use  of  neural  networks  and  adaptive  control 
theory  to  augment  an  existing  controller.  In  the  interest 
of  simplicity,  the  current  development  is  based  on  a 
gain-scheduled  linear  controller  with  the  simple 
proportional-plus-derivative  (PD)  form: 

“gs  =kp(r-x)+kd(f-x)  (4) 

where  the  gains  kp  >  0  and  kd  >  0  are  functions  of  the 
state  x.  In  practice,  these  gains  may  depend  additionally 
on  other  system  parameters,  and  integral  action  may 
also  be  included. 

Assume  that  applying  the  control  in  Eq.  (4)  to 
the  dynamics  in  Eq.  (1)  yields  the  closed-loop  system: 
x=f(x,x)+g(x,x)iQS 

=  -0)^x-2^coi.x  +  a)^xt +A(x,x,r,r)  (5) 

=  fm(x,x,r)+A(x,x,r,f) 

where  an  error  function  (A)  has  been  defined  by 
A(x,x,r,r)=f(x,x)+g(x,x)ucs -fm(x,x,r).  This  error 
represents  the  deviation  of  the  closed-loop  dynamics 
from  the  desired  response  model.  It  is  important  to  note 
that  Eq.  (5)  implicitly  assumes  that  the  closed-loop 
system  is  in  fact  second-order.  While  this  condition  may 
be  somewhat  restrictive,  it  is  unavoidable  in  the  current 
implementation.  Further  study  is  necessary  to  precisely 
characterize  the  implications  of  this  assumption. 

Neural  Network-Based  Adaptive  Control 

We  are  now  prepared  to  augment  the  system  in  an 
effort  to  eliminate  the  effect  of  the  error  function 
A(x,  x,r).  To  accomplish  this,  we  inject  an  additional 
control  signal  (-uad)  at  the  plant  input.  This  results  in  the 
modified  system  described  by 

x=fm(x,x,r)+A(x,x,r,r)-g(x,x)uad  (6) 

Figure  1  depicts  the  augmented  system  of  Eq.  (6). 


Figure  1:  Neural  Network  Augmentation 
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Defining  the  error  variable  x  =  xm-x,  we  may 
subtract  Eq.  (6)  from  Eq.  (3)  to  obtain  the  following 
second-order  equation  for  the  error  dynamics: 

x  =  -(0^x-2^(0nx-i-g(x,x)jiiJ  -A(x,x,r,r)  (7) 

Further,  define  a  pseudo-control  v  using  the  following 
control  transformation: 

v^x.x)^  <=>  uad  =g-'(x,x)v  (8) 

where  g(x,x)  is  a  nonsingular  approximation  of  the 
unknown  control  influence  matrix  g(x,x).  We  may 
then  rewrite  Eq.  (7)  and  substitute  to  obtain 
x  =  -co;x  -  2;tonx  +  g(x,  x)^  +  [g(x,  x)-g(x,x)]uaJ 
-A(x,x,r,r) 

=  -co^x-2^conx  + v  (9) 

-{^(x,x,r,r)-[g(x,x)-g(x,x)^',(x,x)v} 

=  -w*  x  -  2^ton  x  +  v  -  A’(x,  x,  r,  r ,  v) 

where  all  uncertain  nonlinear  terms  have  been  collected 
into  one  uncertainty,  denoted  by  A' .  Note  that  if 
g(x,x)=g(x,x)  and  in  particular  if  g(x,x)=g(x,x)=  1 , 
then  A'  =A .  It  will  later  be  useful  to  define  the  pseudo¬ 
control  as  the  sum  of  two  contributions  as  follows: 

v  =  vad-v  (10) 

where  vad  is  an  adaptive  term  to  be  computed  using  a 
neural  network,  and  v  will  be  designed  to  account  for 
nonlinear  parameterization  effects. 

A  minimal  state-space  realization  of  the 
augmented  error  system  in  Eq.  (9)  is  then  given  by 

z.  =  Az  +  bfv^  -  v-A'(x,x,r,r,v)]  (11) 

where  z  denotes  the  state  vector,  A  is  Hurwitz,  and 
bT  =  [0  l].  This  is  similar  to  the  form  of  the  dynamics 
considered  in  Ref.  9.  Note  that,  in  contrast  with  the 
original  gain-scheduled  controller,  it  will  be  necessary 
to  integrate  the  state  xm  of  the  desired  model  online. 

We  now  describe  briefly  the  neural  network 
architecture  used  to  parameterize  A'  and  compute  vad. 
The  neural  network  topology,  depicted  in  Figure  2,  is 
representative  of  a  common  class  of  feedforward 
networks  known  as  multilayer  perceptrons.  Because  of 
their  universal  approximation  capability,  we  will  restrict 
attention  to  networks  with  a  single  hidden-layer  of 
sigmoidal  neurons  (defined  below).  In  Figure  2, 
xe9IN’  denotes  the  neural  network  input  vector, 
V  e  9!N3*N|  denotes  the  matrix  of  interconnection 
weights  between  the  input  layer  and  the  hidden  layer,  N2 
is  the  number  of  hidden-layer  neurons,  w  e  9^N2  is  the 
vector  of  interconnection  weights  between  the  hidden 
and  output  layers,  and  y  represents  the  neural  network 


Figure  2:  Neural  Network  with  One  Hidden  Layer 


output.  The  output  of  the  particular  multilayer  network 
shown  in  Figure  2  is  written  mathematically  as 


where  a  is  the  hidden-layer  activation  function,  Vy  are 
the  first-to-second  layer  interconnection  weights,  and  Wj 
are  the  second-to-third  layer  interconnection  weights. 
The  bias  terms  0W  and  9vi  represent  thresholds.  This 
architecture  has  N,  inputs  and  N2  hidden-layer  neurons. 
The  form  of  the  hidden-layer  activation  function  is  a 
design  parameter,  but  we  will  consider  only  the 
sigmoidal  functions  with  activation  potential  a: 


o(z)  =  - 


l  +  e_az 

We  may  express  Eq.  (12)  in  matrix  form  as 
y  =  wTa(vTx) 


(13) 

(14) 


where  the  thresholds  are  incorporated  into  the  weight 
matrices  as  follows: 


Here,  we  have  defined 

yT  =  iy,  y2  •••  yNJ 

xT=[l  X,  X,  xj 

o(z)=[l  z,  z2  •••] 

A  neural  network  of  the  type  described  above  is 
capable  of  approximating  any  smooth  function  to  any 
desired  accuracy,  provided  the  number  of  hidden-layer 
neurons  N2  is  sufficiently  large.  This  implies  that  for 
continuous  A'(x,x,r,r,v)  and  any  En>0  there  exists  a 


finite  N2  and  “ideal”  weight  matrices  W  and  V  such  that 

A'  =  wTa(vTx)+e(x)  (15) 

with  ||e|  <  eN  .  Given  the  parameterization  of  A' 

described  by  Eq.  (15),  an  adaptive  neural  network  of  the 
same  structure  is  therefore  used  to  compute  vad. 
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vaJ  =  wTo(vTx)  (16) 

In  this  expression,  the  network  weights  w(t)  and  V(t) 
are  adjustable  parameters  which  will  be  updated  such 
that  vad  approximately  cancels  the  effect  of  A  . 

An  adaptive  scheme  will  be  employed  to  compute 
the  weight  matrices  w  and  V  on-line  so  that  the 
network  requires  no  off-line  learning  phase.  The 
Lyapunov  arguments  used  to  derive  stable  learning  rules 
also  yield  the  required  expression  for  the  pseudo- 
control  term  v.  First,  assume  that  the  ideal  weight 
matrices  are  bounded  in  the  sense  that  ||w|j<w  and 

|V|If  —  V  ,  where  ||  ■  ||  p  denotes  the  Frobenius  norm, 

||A||p  =tr|ATA|.  When  not  specified  otherwise,  ||  •  || 

indicates  the  Euclidean  norm.  Introducing  more 
compact  notation,  we  may  also  write 

Z  =  [w  V]  with  ||Z||F  <  Z  (17) 

The  reference  command  and  its  first  time-derivative  are 
also  assumed  to  be  norm-bounded. 


Observing  the  functional  dependencies  in 
Eq.’s(ll)  above  and  (26)  to  be  presented  later,  we 
choose  the  network  input 


xT  =  1  ZT  r  f  Vad  zjl 

If 

(18) 

which  is  bounded  as  follows: 

x  <c,  +c,  z  +c,  Z  ;c(  >0 

(19) 

A  key  result  which  facilitates  the  extension  of 
theory  previously  developed  for  simple  linear-in¬ 
parameters  neural  networks  to  more  complicated  neural 
networks  with  one  hidden  layer  has  been  described  in 
Ref.  8.  It  involves  the  use  of  a  Taylor  series  expansion 
of  the  hidden-layer  output.  First,  define  the  error 
variables  w  =  w-w,  V  =  V - V ,  and  Z  =  Z - Z . 
Also,  define  the  hidden-layer  output  error  as  follows: 

6  =  d-o  =  o(vTx)-  o(vT  x)  (20) 

The  Taylor  series  expansion  of  o  about  6  may  then  be 
written  as 

o(vT  x)  =  o(vT  x)-  o'(vT  x)vT  x  +  0(VT  x)2  (2 1 ) 

where  o'(z)=  do(z)/dz[  .  and  0(-)2  denotes  those  terms 

with  order  greater  than  one.  These  higher-order  terms 
satisfy  Eq.  (22): 

o(vTxf  =^(vTx)-o(vTx)]+a'(vTx)vTx 


c,  >  0 

where  the  inequality  follows  from  the  definition  of  the 
sigmoid  function  and  properties  of  vector  norms. 


In  preparation  for  the  Lyapunov  analysis,  rewrite 
the  error  dynamics  as  follows: 

z  =  Az+  wT6- wTo-e  — v  ^ 

=  Az  +  wT6- wT6+ wT6-e- v 
Recalling  Eq.  (20),  we  may  also  write 

z  =  Az  +  wT(o-6,Vxj+wT6,VTx  +  (d-v)  (24) 

where  6'  =  o'^VTx^  and  any  extraneous  terms  have 
been  collected  into  the  disturbance  signal,  d. 

d  =  wT6'VTx  +  wTo(vTx]f  - e 

=»M£C„+C,||zj|p  +c!||zJfH+c,[z||’  (25) 

Ck  >0 

The  norm-bound  on  d  follows  from  Eq.’s  (19)  and  (22) 
along  with  the  boundedness  of  e  and  6'  by  certain 
norm  inequalities. 

For  the  stability  analysis,  consider  a  typical  choice 
of  Lyapunov  function  candidate. 

V,  =  —  zTz+— wTw  +  —  tr(vTr~'v)  (26) 

2  2y  2  x  ' 

The  Lyapunov  analysis  is  similar  to  that  of  Ref.  9,  with 
modifications  to  account  for  the  structure  of  the  error 
dynamics  in  Eq  (11).  Based  on  this  development, 
learning  rules  for  the  weight  matrices  are  chosen  as 
follows: 

^  =  ^  =  -rv  [x (zT w To  )+  A|z|| V ]  (27) 

w  =  w  =  -y[(o  -  6'VTx)?;T  +  X||z|jw] 

Note  that  the  X  terms  in  Eq.  (27)  correspond  to  the 
e-modification  found  in  adaptive  control  literature. 
These  terms  provide  additional  damping  which  helps  to 
contain  the  growth  of  the  parameter  estimates.  In 
ad.1''  ■-  the  pseudo-control  term  v  in  Eq.  (10)  is 
defined  as 

v  =  Kz(|z||F  +  z)bTz  (28) 

where  the  gain  Kz  must  be  chosen  such  that 

Kz>C2  (29) 

in  order  to  effectively  dominate  nonlinear  phenomena 
associated  with  parameter  errors. 

We  then  conclude  that  the  trajectories  of  the 
closed-loop  error  system  are  uniformly  ultimately 
bounded  provided  that,  in  addition  to  X  >  C3 ,  either  of 
the  following  two  inequalities  hold: 


where  C4  =  (A-Z  +  C^/^-Cj)  and  o(a)  is  the 
minimum  singular  value  of  A. 
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In  this  section,  the  neural  network-based  adaptive 
control  methodology  described  above  is  used  to 
augment  an  existing  gain-scheduled  linear  autopilot  for 
the  agile  anti-air  missile  described  in  Ref.  8.  Studies  of 
high  angle-of-attack  maneuvering  have  determined  that 
an  appropriate  strategy  for  this  vehicle  is  to  design  a 
bank-to-turn  autopilot  which  tracks  external  guidance 
commands  in  angle-of-atlack  while  maintaining  near¬ 
zero  sideslip.  There  are  several  possible  choices  for  the 
roll-channel  command,  including  aerodynamic  bank 
angle  (p),  body  roll  angle  (<(>),  body  roll  rate  (p),  and 
stability-axis  roll  rate  (ps=  pcosa  +  rsina).  Whenever 
necessary,  the  following  discussion  assumes  that  the 
derivatives  of  external  commands  are  available  as 
outputs  of  a  command  rate  limiter  or  pre-filter. 


Nonlinear  Dynamics  &  Gain-Scheduled  Design 


The  motion  of  a  symmetric  missile  about  its 
velocity  vector  may  be  described  by  the  following 
equations. 

V  =  (X  +  gx  +Tx)cosoecosP  +  (Y  +  gy  +  Ty)sinP 
+  (Z  +  g,  +Tz)sinacosP 

“  =  —  1  -ra  [(Z  +  g,  +  T,  )cos a  -  (X  +  g x  +T  )sina] 

Vc°sp  (31) 

+  q  -  (r  sin  a  +  p  cos  a )tan  P 

P  =  ~  [(x  +  g  s  +  T>  )cos  a  sin  P  +  (Y  +  g  y  +  T(  )cos  P 

+  (Z  +  gz+T)  sin  asin  p]+ p  sin  a  -  r  cos  a 
where  V,  a,  and  P  denote  airspeed,  angle-of-attack,  and 
sideslip  angle  while  p,  q,  and  r  are  the  body-axis 
angular  rates.  The  quantities  X,  Y  and  Z  are  the  body- 
axis  components  of  the  acceleration  due  to  aerodynamic 
forces.  Similarly,  gxyz  and  Txyz  denote  body-axis 
components  of  acceleration  associated  with 
gravitational  effects  and  thrust,  respectively.  The 
moment  equations  have  the  form 


L 


where  L,  M,  and  N  represent  aerodynamic  moments 
about  the  body  axes.  Finally,  Ixx  and  Iyy  are  rolling  and 
pitching  moments  of  inertia,  respectively,  and  are 
assumed  for  simplicity  to  have  constant  or  slowly 
varying  values. 


We  will  now  derive  simplified  models  of  Eq’s.  (31) 
and  (32)  which  will  be  used  for  purposes  of  linear 
autopilot  design.  First  we  neglect  gravitational 
acceleration,  since  it  is  divided  by  V.  Zeroing  the  lateral 
dynamics  and  linearizing  the  longitudinal  dynamics 
about  a  particular  angle-of-attack  oto  results  in 

a  =— (q  +  Zaa  +  Z6e8e  -Tx  sina0  +  Tzcosa0) 

\ ,  ,  <33> 

q=T-(Maa+Mqq+MsA+Mx) 

lyy 

where  5e  denotes  the  pitch  control  input,  MT  is  the 
pitching  moment  caused  by  thrust  and 

za  = —  =  [(9z/3a - gx  -Tx  - X)cos a 
da 

-  (dX/da  - gz  -Tz  -  z)sin  a|a=ao 

Zs e=^T  =  KaZ/35e  )cos  a  “  (3x/35e  )sin  a] 


Ma=^r 


M^, 


Ms  =- 


Neglecting  gravity  and  the  constant  term  Txsinao,  we 
may  write  the  linear  pitch-plane  model 


A  similar  process  results  in  the  following  analogous 
linear  design  model  for  the  yaw-roll  channel: 

[Pi  [(vv  +  sina0)  (Yp/V  +  sina0)  (Yr/V  +  sina0)l 


P  _  fp/lxx  fp /Kx  Fr/^xx 

i\  L  Np/’yy  Np/Iyy  Nr/Iyy  j  (35) 

'  Y */V  Y8r/V  1 

+  L5 a/*xx  L6r/'xx 
h*6a/^yy  ^6r/^yy 

In  the  example  which  follows,  we  will  use  adaptive 
neural  networks  to  augment  a  gain-scheduled  autopilot 
designed  for  the  agile  anti-air  missile  featured  in  Ref.  8 
using  linear  quadratic  regulator  theory  as  described  in 
Ref.  12.  The  design  consists  of  decoupled  pitch  and 
yaw-roll  channels,  and  includes  integral  control  as  well 
as  dependence  on  vehicle  mass  and  Mach  number, 
necessitating  minor  deviations  from  the  technique 
described  above.  The  general  design  philosophy, 
however,  remains  unchanged.  For  simplicity  of 
presentation,  we  will  focus  on  the  longitudinal 
autopilot,  which  has  the  form: 

5c  =  kp(a,M,mXat -a)+ki(a,M,m)j^(ai. -a)Tc  ^ 

+  kd(a,M,m)q 

Note  that  pitch  rate  q  is  used  as  a  state  variable 
rather  than  a ,  since  the  equation  of  motion  governing 


5 

Amqpican  Institute  of  Aeronautics  and  Astronautics 


angle-of-attack  is  not  easily  expressed  in  second-order 
form.  By  virtue  of  the  relationship  between  a  and  q, 
however,  the  gain-scheduled  control  defined  by  Eq.  (29) 
may  be  seen  to  bring  the  closed-loop  system  into  a  form 
similar  to  Eq.  (5). 

a  =  -co^a-2^(ond  +  a)^a.  +  A(a,q,a.)  ^ 

=  fm(a,q,a..)+A(a,q,a  ) 

Assuming  that  the  roll  channel  can  be  regulated 
effectively  by  a  simple  proportional  feedback  control, 
an  implementation  similar  to  the  longitudinal  design 
may  be  used  for  the  yaw  channel. 

Neural  Network  Augmentation 

We  now  introduce  scalar  neural  network 
augmentation  in  yaw,  pitch,  and  roll.  A  feedforward 
neural  network  with  a  single  hidden  layer  is  capable  of 
approximately  reconstructing  the  error  function  (A)  in 
each  channel  and  may  be  used  to  compute  uad(t)  as 
described  previously. 

A  _1 

Uad=g  V 

v  =  vad-v  (38) 

v*  =  wTa(vTx) 

Moreover,  a  stable  learning  rule  is  again  given  by 
Eq.  (27).  Various  neural  network  topologies  and 
choices  of  inputs  were  considered  in  this  study.  The 
results  presented  correspond  to  a  neural  network  with 
only  five  sigmoidal  hidden-layer  neurons.  Each  neuron 
has  an  internal  activation  potential  of  a  =  0.01.  Network 
inputs  were  chosen  to  include  z,  r,  vad  ,  ||z||  ,  and 

Mach  number  along  with  the  customary  bias  term. 
Although  it  does  not  appear  in  Eq.  (16),  Mach  number 
is  included  here  because  its  variation  affects  the 
nonlinear  inversion  error.  This  does  not  alter  the  proof 
of  stability,  since  ter  our  purposes  Mach  number  may 
be  considered  a  bounded,  slowly-varying  parameter. 
The  network  input  (  was  omitted  in  order  to  reduce  the 
size  of  the  network.  The  learning  rates  were  chosen  to 
be  y  =  500  and  Tv  =  500x  I .  The  damping  coefficient  in 
the  weight  update  was  assigned  the  value  A.  =  0.01 ,  and 
the  term  v  was  neglected  for  simplicity. 

Simulation  Results 


Based  on  the  results  obtained  in  Refs  7  and  9,  a 
properly  designed  neural  network  should  achieve 
desired  performance  in  the  face  of  uncertainty.  These 
previous  efforts  describe  implementations  in  which 
neural  networks  were  employed  in  conjunction  with 
crudely  designed  nonlinear  control  laws  derived  from 
approximate  dynamic  inversion.  Since  these  efforts  both 


perform  at  least  as  well  as  the  gain-scheduled  autopilot, 
the  combination  of  neural  network  and  gain-scheduled 
controller  is  expected  to  achieve  similar  results. 

Numerical  simulation  studies  have  been  performed 
using  a  nonlinear,  six-degree-of-freedom  (6-DOF) 
simulation  of  the  agile  anti-air  missile  featured  in 
Ref.  8.  Figure  3  shows  step  responses  with  and  without 
neural  network  augmentation  for  an  angle  of  attack 
command  with  magnitude  90  degrees.  This  comparison 
indicates  that  the  addition  of  the  neural  network  causes 
the  response  to  follow  the  prescribed  first-order  model. 
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Figure  3:  Angle  of  Attack  Response  Comparison 


A  typical  engagement  geometry  for  an  agile  missile 
of  this  type  is  the  “head-on  merge”  scenario  illustrated 
in  Figure  4.  This  engagement  geometry  represents  a 
planar  intercept  with  both  missile  and  target  trajectories 
evolving  primarily  in  the  local  horizontal  plane. 
Although  individual  scalar  neural  networks  were 
implemented,  coupling  effects  were  addressed  by 
including  cross-channel  variables  as  network  inputs. 
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Figure  5:  Angle  of  Attack  and  Sideslip  Response 


Figure  7:  Yaw  RCS  Control  Comparison 


Figure  6:  Pitch  RCS  Control  Comparison  Figure  8:  Roll  RCS  Control  Comparison 


Figure  5  depicts  the  angle  of  attack  and  sideslip 
tracking  response  for  the  gain-scheduled  autopilot  with 
adaptive  neural  augmentation.  The  sideslip  angle 
remains  within  acceptable  values,  but  the  overall 
response  of  the  system  is  somewhat  sluggish.  This  is  an 
inherent  disadvantage  of  adaptive  schemes,  adaptation 
occurs  over  a  small  but  finite  interval.  These  results  are 
quite  favorable,  however,  especially  considering  the  fact 
that  the  design  technique  outlined  above  does  not 
explicitly  address  performance.  More  research  is 
needed  to  fully  characterize  the  interaction  between  the 
nominal  closed-loop  dynamics  associated  with  the  gain- 
scheduled  control  and  the  dynamics  of  the  adaptive 
neural  augmentation  system.  As  expected,  Figures  6-8 
indicate  that  the  results  achieved  necessitated  only 
slight  modification  of  the  RCS  thruster  commands.  This 
is  a  feature  of  the  neural  network  implementation, 
which  does  not  encourage  excessive  control  activity  or 
actuator  saturation. 


Conclusions 

A  control  design  methodology  enabling  the 
integration  of  recently  developed  neural  network-based 
adaptive  control  technology  with  existing  gain- 
scheduled  linear  controllers  has  been  developed.  In  the 
proposed  architecture,  a  neural  network  adaptively 
augments  plant  input  signals  in  order  to  enforce  tracking 
of  a  reference  model.  This  development  was  motivated 
by  previous  research  involving  nonlinear  dynamic 
inversion  controllers,  which  suggested  that  this  design 
technique  could  potentially  improve  both  the 
performance  and  robustness  of  existing  gain-scheduled 
control  laws.  As  an  example,  the  problem  of  designing  a 
longitudinal  missile  autopilot  to  track  commands  in 
angle-of-attack  was  considered.  Nonlinear  6-DOF 
simulation  results  indicated  that  a  neural  network  added 
to  the  longitudinal  channel  of  the  gain-scheduled 
autopilot  resulted  in  improved  tracking  of  a  reference 
model.  This  effect  was  achieved  by  explicitly 
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accounting  for  nonlinear  dynamics  through  an  adaptive 
neural  network  parameterization. 

Efforts  to  implement  a  similar  augmentation  system 
for  the  roll-yaw  autopilot  are  in  progress.  In  addition, 
further  research  is  necessary  in  order  to  extend  this 
control  design  methodology  to  the  more  general  case 
where  the  order  of  the  closed-loop  system  under  the 
action  of  the  original  gain-scheduled  controller  is 
uncertain.  This  may  necessitate  the  use  of  dynamic  or 
recurrent  neural  network  architectures,  which  include 
time-delays  and  are  capable  of  learning  to  approximate 
the  input-output  relationships  of  dynamical  systems. 
These  recurrent  architectures  are  currently  the  subject 
of  ongoing  research  in  which  they  are  being  employed 
as  parameterizations  of  uncertain  dynamical  systems  for 
the  purpose  of  adaptive  control. 
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